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b.  Ipfr^gtjgq 


The  problem  of  biserlal  correlation  arises  wnen  one  is  sampling  from 
a bivariate  normal  population  in  vhich  one  of  the  variables  has  been  dicho- 
tomized, giving  rise  to  only  two  observable  values,  say  0 and  1,  and  one 
wishes  to  use  this  dichotomized  sample  to  estimate,  or  test  hypotheses  con- 
cerning the  correlation  coefficient  P of  the  original  bivariate  normal  dis- 
tribution. P is  sometimes  given  the  name  biserial  correlation  coefficient. 
This  name  reflects  the  former  confusion  between  sample  statistics  and  popu- 
lation parameters,  referring  of  course  to  the  fact  that  a sample  drawn  from 
the  observable  bivariate  population  just  described  may  be  thought  of  as  two 
separate  series  of  observations,  those  in  which  the  dichotomized  variable 
has  the  value  0 and  those  in  which  it  has  the  value  1.  It  is  apparent  that 
the  numbers  of  observations  in  the  two  series  are  dependent  binomial  random 
variables  whose  sum  is  the  sample  size. 

The  term  biserial  correlation  was  introduced  in  1909  by  Karl  Pearson 
[6],  who  was  the  first  to  perceive  the  statistical  importance  of  this  par- 
ticular type  of  problem.  He  proposed  as  an  estimator  the  sample  biserlal 
correlation  coefficient.  The  asymptotic  variance  of  this  estimator  was  de- 
rived in  1913  by  Soper  ftO . Since  that  time,  with  the  exception  of  certain 
references  to  discriminatory  analysis  (see  fj-3]) , in  which  the  use  of  the 
sample  biserial  correlation  coefficient  goes  unquestioned,  no  results  of  a 
mathematical  nature  were  contributed  until  a recent  paper  by  Maritz  [$], 

Much  literature  exists,  however,  on  the  subject  of  how  best  to  compute  Pear- 
son's coefficient.  In  this  connection  the  reader  should  see  DuBois  f 2], 
Dunlap  [3j,  ami  Royer  [7J. 

The  problem  of  biserial  correlation  occurs  quite  often  in  psychological 
woxic,  especially  in  that  branch  of  the  subject  known  as  Test  Construction 
end  Validation.  In  connection  with  an  objective  test  one  may  be  interested 
in  obtaining  a measure  of  the  strength  of  the  relation  between  the  ability 
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to  answer  correctly  a particular  item  in  the  test  and  ability  to  perform  at 
some  task,  or  in  testing  hypotheses  concerning  the  strength  of  this  relation. 

Such  a measure,  together  with  a r andean  sample  of  individuals,  selected  of 
course  from  the  population  for  which  Inferences  are  to  be  drawn,  helps  to 
determine  whether  or  not  the  given  item  should  be  included  in  the  test.  In 
order  to  put  the  problem  in  the  proper  fora  certain  assumptions  must  be  made. 
Suppose  that  the  ability  to  answer  the  test  item  correctly  can  be  represented 
numerically  by  a random  variable  with  a normal  probability  distribution, 
which  however  cannot  be  observed  due  to  the  restrictions  of  the  test;  in  par- 
ticular, suppose  that  observations  on  this  ability  take  the  value  0 if  the 
question  is  answered  incorrectly,  and  the  value  1 if  the  question  is  answered 
correctly.  If  in  addition  to  the  underlying  normal  distribution  just  postu- 
lated, we  assume  that  the  ability  to  perform  at  the  task  is  aleo  measurable 

* 

with  a normal  distribution,  and  that  the  two  normal  variables  have  correlation 
coefficient  P,  then  ve  have  the  problem  of  biserial  correlation.1  A simple 
example  would  occur  when  a true-false  question  is  included  among  the  questions 
in  a preliminary  college  aptitude  teet  and  then  a follow-up  study  is  conducted 
on  the  sampled  students  in  order  to  observe  their  final  grade  point  average 
upon  graduation  four  years  later.  In  this  case  P would  represent  the  degree 
of  association  between  ability  to  answer  the  question  correctly  and  ability 
in  school,  and  would  be  estimated  by  some  function  of  the  paired  observations 
on  the  sample  students.  It  should  be  notbd  that  in  such  a case  acceptable 

e 

test  items  are  those  for  which  \ P|  is  judged  to  be  near  1.  Ve  shall  refer 
to  this  example  several  times  in  the  sequel  in  order  to  Illustrate  certain  points. 
^Pearson's  original  formulation  was  less  restrictive  than  that  given  here. 
Anticipating  the  fact  that  problems  in  estimation  and  testing  hypotheses 
will  require  assumptions  of  normality,  we  make  these  assumptions  at  the 
outset. 
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In  the  above  example  it  is  assumed  that  there  exists  an  underlying 
distribution  of  ability  to  answer  the  true-false  question  correctly,  and 
that  in  addition  this  distribution  is  nonaal.  Since  it  is  not  possible 
to  observe  the  underlying  distribution,  it  also  is  not  possible  to  test 
the  assumption  of  underlying  normality,  but  only  that  the  underlying  dis- 
tribution is  normal  given  that  there  is  an  underlying  continuous  distri- 
bution. Therefore,  in  many  situations  in  which  biserial  correlation  is 
appealed  to,  the  assumptions  involved  are  open  to  serious  attack,  an 
attack  for  which  there  is  no  adequate  statistical  defense. 

If  one  wishes,  he  may  give  up  the  assumption  of  underlying  normality 
and  assume  instead  that  the  observed  bivariate  distribution  is  that  of  a 
discrete  random  variable  which  takes  the  values  0 and  1,  and  a continuous 
random  variable  with  the  property  that  its  conditional  distribution  be 
normal  for  each  given  value  of  the  discrete  variable.  With  this  formu- 
lation good  results  may  be  obtained  if  one  is  willing  to  make  the  assump- 
tion that  the  two  normal  conditional  distributions  have  the  same  variance. 
The  problem  of  estimating  or  testing  hypotheses  concerning  P under  this 
set  of  conditions  is  known  as  the  problem  of  point-biserial  correlation 
and  is  treated  in  reference  (loj.  Some  of  the  results  obtained  there  are 
similar  to  those  of  the  present  paper.  Note  that  this  set  of  conditions 
may  be  tested  statistically  by  testing  the  normality  of  the  conditional 
distributions.  Moreover,  no  confusion  between  the  two  model's  would  occur, 
since  in  the  case  of  biserial  correlation,  it  is  easily  shown  to  be  im- 
possible for  the  conditional  distributions  to  be  normal.  In  the  problem  of 
estimating  the  correlation  between  ability  to  answer  a test  item  correctly 
end  ability  in  school,  the  use  of  point-biserial  correlation  requires  that 
(i)  for  each  student  who  answers  the  test  item  correctly,  the  conditional 
distribution  of  his  grade-point  average  be  normal  with,  say,  mean  /£♦, 
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and  variance  cr  j , and  that  (ii)  for  each  student  who  answers  thu  test  item 
incorrectly,  the  conditional  distribution  of  hie  grade-point  average  be 
normal  with  mean  S' o and  variance  c r^.  Note  that  it  is  necessary  for  the 
variability  of  grade-point  average  to  be  the  same  for  the  two  groups  of 
students. 

Professor  Harold  Hotelling  realised  some  years  ago  that  the  existing 
methods  for  dealing  with  the  problem  of  biserial  correlation  were  far  from 
satisfactory,  and  suggested  to  tie  author  that  the  whole  situation  be  recon- 
sidered. The  results  of  this  examination  are  contained  in  the  present  paper. 

Section  2 contains  a list  of  most  of  the  notation  which  has  been  adopted, 
and  Section  3 deals  with  the  mathematical  model. 

In  Section  4 the  question  of  maximum  likelihood  is  treated.  The 
maximum  likelihood  estimator  is  3hovn  to  be  asymptotically  normal  and 

A A 

asymptotically  efficient.  The  asymptotic  variances  for  r and  , the 
maximum  likelihood  estimators  of  the  correlation  P and  point  of  dichotomy 
&>,  are  found  by  the  usual  method  which  employs  the  information  matrix  and 
side-steps  the  solution  of  the  likelihood  equations.  A valuable  contribution 
to  the  theory  of  biserial  correlation  was  made  by  Marita  [5J.  Comments  are 
made  on  his  work,  and  on  a paper  of  Tocher  QjJ , in  the  early  part  of  Section  4. 

An  evaluation  of  x*  , the  sample  biserial  correlation  coefficient,  is 
given  in  detail  in  Section  5.  It  is  shown  that  r*  has  asymptotic  efficiency 
for  estimating  P which  is  1 whin  P * 0,  but  which  approaches  0 when  l f\ 
approaches  1.  Consistency  of  r*  was  shown  by  Karl  Pearson  [6J.  The  well- 
known  fact  that  r*  may  be  greater  than  1 is  pointed  out  and  some  notion  of 
the  magnitude  of  r is  obtained  by  a consideration  of  the  product  moment 
correlation  coefficient  r.  Asymptotic  normality  of  r*  is  verified  by  the 
use  of  a theorem  of  Cramer.  The  asymptotic  standard  deviation  of  r is 
tabulated  in  Table  I at  the  end  of  the  paper.  One  interesting  point  in 
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Section  5 ie  an  intuitively  appealing  fact  which  the  author  discovered  is 
universally  assumed,  but  apparently  was  never  proved:  rudely*  ttia*,  the 
asymptotic  variance  of  r*  is  a minimum  at  w»c  for  each  .fixed  p , A 
proof  is  given  for  this  result.  For  the  case  vu  *0  an  approximate  variance 
stabilising  transformation  is  derived.  Calculations  pertaining  to  this  trans- 
formation may  be  carried  out  by  using  Table  VB  of  Fisher  (&]  for  the  function 
tanh**-^  (r).  This  result  should  prove  useful  in  many  situations.  Section  5 
concludes  with  a discussion  of  the  preceding  results  and  of  the  feasibility 
of  using  r*  to  test  the  hypothesis  H*  /9*rfo  when  i?o I is  small. 

Section  6 is  devoted  to  a discussion  of  an  iterative  method  of  solution 
for  the  likelih:  ; luationa.  Tba  method  is  essentially  Bewton*  e method  for 
two  variables,  the  calculated  values  u>  , r being  used  to  start  the  iter- 
ation. The  computations  are  not  really  prohibitive,  considering  the  import-  ’ 
ance  of  the  problem,  and  are  to  a certain  extent  organizable  for  punched- 
cards  methods.  An  example  is  given  with  all  of  the  calculation  illustrated. 
The  data  consist  of  a sample  of  20  observations  taken  from  an  artificially 
constructed  bivariate  normal  population  with  f-  .707. 

Values  of  Mills’  ratio  , 


are  required  for  the  cal aula  lions . For  purposes  here  we  shall  need  a table 
whi  .1  gives  <f(x)  for  x ranging  from  -3  to  * 3 in  steps  of  .01,  so  that 
no  transformation  or  extensive  interpolation  is  required,  since  it  will  be 
necessary  to  obtr  . n.  values  of  ^ for  a given  problem  in  which  a sample  of 
n has  been  taksu.  Accordingly,  we  include  Table  II  at  the  end  of  the  paper. 
Interpolation  by  inspection  in  Table  II  should  be  quite  satisfactory. 

Hie  subject  of  biserial  correlation  is  generally  given  a light  treat- 
ment in  texts  on  psychological  statistics,  centering  on  the  unboundedness 
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of  v*  and  the  questionable  character  of  the  assumption  of  underlying  nor- 
mality for  the  dichotomized  variable.  A notable  exception  is  the  recent  book 
by  Valker  and  L.v  (12],  which  has  a more  complete  treatment  of  the  subject. 

A few  of  the  results  contained  ii  the  present  paper  are  referred  to,  and  illu- 
strated in  £12]. 


2.  Notation 

To  eliminate  the  distractioi  of  searching  through  the  text,  we  shall 
list  here  most  of  the  symbols  and  notations!  devices  used. 


fOwr) 

p(x) 

Hx) 

5 (*»«*») 

j 

i 

•>» 

z 


. 2(i- 


2tr 


(x2  - 2f>  ay  - y2) 

» 


the  bivariate 
normal  density 


the  normal  density 


«o 

J^CtJdt,  q(x)  * 1 - p(x) . 

x 


Alii  , Mills1  ratio. 
p(x) 


I f <*,y)dy 


I 


the  undichotonisod  normal  random  variable. 

the  dichotomised  normal  random  variable. 

the  point  of  dichotomy  of  I,  measured  in  standard  units. 

the  discrete  random  variable  induced  by  the  dicho  tool  action  of  X. 
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the  joint  density  of  the  random  variables  X and  Z. 
the  correlation  coefficient  of  the  random  variables  X and  Y. 
the  maximum  likelihood  estimator  of  f . 
the  sample  Mserial  correlation  coefficient, 
the  ordinary  sample  correlation  coefficient  based  on  the 

t 

sample  (X^,Z^)  ^ i — l,2,.^.,n« 
the  asymptotic  variance  of  r*. 

the  asymptotic  efficiency  of  r*  for  estimating  f • 

a normal  random  variable  with  mean  [*-  and  variance 

a bivariate  norma'  random  vector  with  means  f*- , , 

o o 

variances  <r  , t , and  correlation 

owaotes  the  fact  that  Un  is  asymptotically  normal  with 
mean  f*~  and  variance  <r"^. 

3.  MathsmaUcal  Model 
Let  (X,I)*&>f  and  60  be  any  fixed  constant.  How  let 

Z be  a Bernoulli  random  variable  defined  as  follows! 

(3.1)  2-1  if  w and  Z«0  if  * 

Obviously, 

04> 

P(2  “1)  * f A (*=$  dy  =pM,  P(Z  * 0)  * qH. 

Consider  e (ample  of  n independent  random  vectors  (X^Z^,  (X2,Z2),..., 
(*nt2n)»  The  problem  of  biserial  correlation  consists  in  finding  a suitable 
function  or  (1^7 J i *l,2,...,n,  with  which  to  estimate  f . 

Karl  Pearson  C6]  introduced  the  estimator  r*  (■Mee-^al  r*) , which  we 
express  in  the  following  forms 


f(x,s) 

r 

r* 


V(r*}«,/») 

£(r*|(s,f) 

(0^(^V,r2,tr2,f) 
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(3,) 

fifc.  <*!•*>*  j 4 *<« 


2ii 


{*  5»a£l 

X(T 


u 

where  le  the  product  moment  correlation  coefficient  of  (X^ ,Z^) , and 
T is  tbs  solution  of  the  equation  • • 


(3.3) 


•© 

j\(y)dy 


r*  will  be  discussed  completely  in  Section  5.  For  the  present  we  shall 
merely  state  the  asymptotic  variance  obtained  by  Soper  f8]r 

(3.4)  V(rV.)  *-£[p4»p2  ’ 

where  the  functions  p,  q,  and  ^ all  have  argument  <*>  u . 

I* 


/n  * air*)  * jn  V(r*lv,p)j 


is  given  in  Table  I at  the  end  of  the  paper.  In  view  of  syuetry  about  the 

values  p » 0 <aid  -p  («>)  * the  tabulation  is  given  for  p * 0 to  1 

in  the  steps  of  10,  and  for  p * .05  ta  .50  in  steps  of  .05. 

* • 

Sinoe  the  random  variable  Z takes  the  value  0 or  1,  the  joint  density 

e 

of  (X,Z)  can  be  written 

• i* 

(3.5)  f(x,s)  * *f(x,l)  r (l-s)f(x,0), 
where 

«/E>P 

(3.6)  f(x,C)~  J y(x,y)dy,  f(x,l)j 


frf(x,  y)dy. 


with  Y (x,y)  denoting  the  density  of  t:2jp).  Sections  4 and  6 

are  devoted  to  a discussion  of  the  likelihood  function, 

n 

i*l 


(3.7)  L = ] ] jU-s^f^O) + Sjf!  (xi*10 


9 


4.  PronarttM  aL  the  Maximum  LikaJLlbOOtii  E§tiMto» 

It  may  be  seen  that  L is  actually  independent  of  ^ and  “C , since  a 
change  of  variable  y'  * (y-*>)/t?  in  the  Integrals  of  f(x,0)  and  f(x,l) 
removes  ^ and  £.  Hence,  in  all  further  work  we  shall  set  ^ » 0 and 

r = 1. 

The  main  stumbling  block  is  the  existence  of  the  nuisance  parameters 
jjl  and  a-?.  The  4 likelihood  equations  in  the  variables  <4  P,  /*,  are  not 
hard  to  write  down,  but  the  algebraic  difficulties  involved  in  the  deviation 
of  asymptotic  variances  and  covariances,  and  the  numerical  difficulties 
involved  in  solving  the  equations  by  an  iterative  method,  prove  far  too  pro- 
hibitive. It  seems  intuitively  clear  that  were  we  able  to  solve  the  4 likeli- 
hood equations  for  uj,  p,  p,  $-2,  we  would  find  that  p » p(x1,x2,...,xnJ 

* * ' ,zn)  is  invariant  under  any  transformation  of  the  form  xj/*axj+b. 
This,  of  course,  does  not  give  us  the  right  to  set  /<  ■ 0 and  £ a*  1,  and 
then  expect  to  find  the  correct  maximum  likelihood  estimators  for  and 
The  following  course  of  action  has  been  adopted  as  a way  out*  set  and 

* 1.  Then  solve  the  2 likelihood  equations,  and  in  the  resulting  solution 
replace  x^  by 

U.n 

If  we  know  the  values  of  ft  and  cr>  the  problem  is  naturally  avoided  by  an 
immediate  transformation  of  the  original  data.  In  all  future  work  we  shall 
assume  *0  and  • 1.  ^f(x,y)  now  becomes  V*(x,y),  and  the  likelihood 

function  takes  the  form 

k(w»f)  - ) I j (*!»<*>)  t (1-^)  ? (x±,U^  ^ . 


(4.2) 
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We  shall  pause  at  this  point  to  discuss  the  work  of  Maritz  £jj • Using 
Probit  Analysis,  together  with  a result  of  Tocher  Q-fl » he  has  given  a very 
nice  approximation  scheme  for  the  solution  of  the  likelihood  equations.  In 
what  follows  we  shall  give  a short  outline  of  his  method  in  terms  of  the 
notation  of  the  present  paper. 

Let  (X,I)»$-V'(0,Ojl,(l-^)~Sp).  In  view  of  the  fact  that  the  likeli- 
hood equations  are  invariant  under  a change  of  V or  r 2,  this  formu- 
lation is  equivalent  to  ours.  Now  introduce  a grouping  of  the  observations 
•|X^;i  =-l,2,...,n}  with  a set  of  k cells  of  equal  width.  Denote  this  group- 
ing by 

=»l,2,...,kj. 

Denote  the  collection  of  midpoints  of  these  cells  by 

* l*2,#..,kj. 

Let 

(4.3)  PJ>k»P(ieiJ>k),  irjtk-P(2-l|x-Sj>k) 

Now  let  Nj>lc  be  the  number  of  observations  in  the  sample jx^i  **  l,2,...,nj 
which  fall  in  1^^,  and  Mj  ^ be  the  number  among  the  Nj  ^ for  which 
the  corresponding  Z observation  is  1.  Thus,  » 1,2, ...,kj  have  a 

multinomial  distribution  with  parameters  |Pj >Jc; j »l,2,...,kj.  The  condit- 
ional distribution  of  — l,2,...,k}  given  •»  n^  k is  the  pro- 

duct of  k binomial  distributions. 

(4«4)  Ht  y*  n p Pj  v s ^ — 1»2,...) 

j-1  >1 

Maritz  now  assumes  that  the  observations  i — 1,2, ...,nj  are  conoan- 

trated at  their  respective  cell  mid  points.  Since  the  marginal  distribution 
of  the  is  independent  of  u>  and  f , the  part  of  the  likelihood  function 
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which  depends  on  f*  and  oJ  will  be  the  conditional  distribution  of  the  ^ 

given  Nj  jj,*5  n^k,  which  has  parameters  Ve  are  thus  ultimately  led 

/ /\ 

to  two  simultaneous  equations  for  and  f which  contain  terms 


iEfUk 

* 


* 


These  equations  are  then  transformed  slightly  and  probit  analysis  is  used  for 
the  solution. 

Presumably,  as  the  grouping  becomes  finer,  the  estimates 
19m2fw»t\fk)  and  f the  asymp- 

totic variances  o\2  and  c-"|  , will  approach  the  correct  values  for  the 
original  problem.  A proof  of  this  result  must  depend  on  a close  examination 
of  the  limiting  processes  involved.  The  situation  which  arises  may  be  described 
as  follows.  Ve  assume  that  the  grouping  becomes  finer  and  we  wish  to  assert 
tn>  things: 


(4.5) 


**  » P(Z  * ill  • x^)ji  * l»2,...,nj  ^ 

~l,2,...,k  j *1,2 ,...,kj 

j #k 


in  the  sense  of  probability. 

The  meaning  of  (4.5)  is,  then,  that  as  k and  n both  become  large  the  cell  width 
must  became  small,  but  in  such  a way  that  each  cell  still  contains  suffici- 
ently many  observations  for  TTj  to  be  a valid  approximation  of  Mj  jt” 

This  result  does  indeed  appear  quite  plausible,  but  a detailed  proof  would  be 
lengthy. 

Instead  of  attempting  a discussion  of  the  above  point,  we  offer  an  alter- 
native derivation  of  the  asymptotic  variances  of  P and  u)  in  this  section, 
and  In  Section  6 and  iterative  scheme  for  obtaining  (pfi)  which,  while  more 
time  consuming  than  that  of  Marita,  does  not  require  any  grouping.  It  should 
be  noted  here  that  Tocher's  sxact  method  (pp.  9-11,  [llj ) , also  known  as  the 


■scoring"  method,  doesn't  help  ir.  this  case,  owing  to  the  difficulty  in 
obtaining  the  expectations  of  the  second  partial  derivatives  of  L.  Again, 
the  plausibility  of  Marita*  scheme  should  be  emphasised. 

Many  results  have  been  obtained  concerning  the  asymptotic  normality 
and  asymptotic  efficiency  of  maximum  likelihood  estimators.  In  each  case 
the  parametric  family  of  probability  distributions  is  sub j tot  to  certain 
regularity  conditions,  The  density  of  (X,Z)  is  f(x,z).  Ve  shall  not  dwell 
here  at  any  length  on  the  regularity  conditions,  but  shall  merexv  remark 
that  the  regularity  conditions  given  by  Cramer  (Chap.  33.3,  D3)  may  be 
easily  verified,  since  f(x,0)  and  f(x,l)  are  both  integrals  of  bivariate  nor- 
mal densities.  Consequently,  will  be  asymptotically  normal,  and 

asymptotically  efficient  estimators  for  u>  and  f respectively . Asymptotic 
variances  of  the  maximum  likelihood  estimators  may  be  found  by  an  inversion 
of  tha  inverse  matrix  without  actually  solving  the  likelihood  equations, 

Ve  now  use  this  technique. 


Iftgprgft  I- 

A 

The  asymptotic  variant.  of  f is  given  by 


(4.6)  V(f  (l-p*)3.  I jx^XjSfJdx 

•a  00 


where 


( £*g(x,u>,f)dxj2  . 

--yar f 

/ g(x*>,f)dx  -1 


g(x,o;,{»)  = A(x)  j>  (~  r j 

proof: 

log  L « j^log  -|*i  ^ (xif^)  T Q--*±)  7 (*1**)}  * 


Ve  will  need  the  quantities 

{&)■  ■( 


log.  \ 
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2 

Letting-  & refer  to  any  of  the  three  second  order  partial  operators,  we 
have  the  fundamental  relation 

(4.7)  E^og  Lj  * nqE^&og)  (X,u>)}  * npE^log  £(!,<*>)}, 

where  Eq  means  expectation  with  respect  to  the  conditional  density  of  X 
given  Y < <ot  and  E1  means  expectation  with  respect  to  the  conditional 
density  of  X given  Y * OJ.  The  conditional  densities  are 

(a 

(4.8)  V(x|Y<w)i  A f f(x, y)dy  * 

* q r 

09 

(4.9)  f(xlYi«u)  -•  A /?(x,y)dy  — A £(x,u»). 

P Zj  P ' 

2 

For  each  of  the  possible  operators  b the  calculation  of  (4.7)  pro- 
cedes  in  about  the  same  way.  As  an  illustration,  we  shall  compute 


For  a random  variable  U with  density  h(u;0),  it  is  well  known  that 

uao,  , . £ ((^f  (Jmzlfj  , 

provided  the  expectations  exist  and  differentiation  twice  under  the  expec- 
tation sign  is  peraissible.  It  is  easily  seen  that 

(4.u)  * V (**>). 

Consider  the  first  term  in  the  right  member  of  (4.7) . From  the  definition 

of  £q  and  (4.8),  together  with  (4.10),  we  have  the  result 

+00 

(4.12)  aqE0  * - n J dx. 


In  a similar  manner  we  show  that 


(4.13) 


Hence,  combining  (4*12)  and  (4.13) , we  get 

4.« 

/.  , *<<a2laffL7  - _ f>(») 


(4.14)  MW1!-  <*• 


Doing  the  relatione. 


'{YM J2  ■» 


$(x,“>)  = 


f(x,u>) 


oo  ' 

A(x)  J"  A(y)dy, 

•£i£L 

rffp- 

\(*)  J A(y)dy,  ' 


we  hare  from  (4.14)  and  the  definition  of  g (x,mf>) 


(4.15) 


similarly, 


1 1*1^1=- 


TZf) 


+ O0 

j"  g(x,t^f>)dx 


(4.!6)  E { ~ / (x-f^)2  g(x,«tf)dx, 


(4.17) 


d-f2)*  j 


j (*-('•»)  g(x, «/,(")«*. 


Forming  the  2x2  information  matrix 


A= 


£ ^figjL-k  E ^2j^E-k 


and  observing  that  j A.|  is  non-vanishing  because  of  the  Schvars  inequality. 


we  finally  obtain 


(4.18) 


W = -A-1  = ^ 

22  _oo  J gdx  * 

(«•«■ -k-^{  r--^r . 

v v -oo  r 2 , J 

J X gdx 
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v-1  th  A-i 

where  A^j  ie  the  lj  element  of  the  Inverse  matrix  , which  proves 

Theorem  I . Expressions  (4.18)  coincide  with  the  previously  mentioned 

results  of  Maritz. 


5.  The  Utility  of  r» 

Ve  shall  now  present  a series  of  results  concerning  x*,  whi  ch  will  be 
followed  by  a general  discussion  of  its  value. 

A little  later  ve  will  need  E(XZ).  Since,  it  is  not  difficult  to 
obtain,  we  will  give  the  expression  for  the  general  moment  of  E(X*Z) . 

IX 

TfcflgrfiB  &: 

= h (j)  (1-f2)7  (* | yK'j  e"  T dy» 

j*  O tjj 


where  a^  is  the  moment  of  the  random  variable  ./(0,1) . 

Proofs  Using  the  definition  of  E^  and  (4.9) , ve  obtain 


E(XkZ)  :=  ^(xK)  - f | (x,y)dydx. 

-00  UJ 

Make  the  transformation  t « (x-^y)  /l-f2.  The  above  then  reduces  to 


**o  e* 


••0  w 


s/5tr 


v/2T 


dydt. 


Using  a binomial  expansion  and  integrating  with  respect  to  t,  we  have 
Theorem  II.  The  integrals  contained  in  Theorem  II  may  be  evaluated  by  a 
recursion  relation.  Let 


<w 


Then, 

b^M  * (✓-l)  b^2  (<J)  tuA1  b^H. 

We  now  easily  arrive  at 

(5.1)  b0H  sr  p(w),  bjM  * >(o>), 
whence 

(5.2)  3iQ  s p(v),  ^ * p>(4i),*2  v pfc-0  -r<^^)p2. 

The  relation  between  P(X,X)  and  p(X,Z)  is  given  by 

TtomI£L:  p(X,Z)  * p(X,Y) 

Proofi  Let  EX  * /<■  * 0 and  V(X)  *”  cr^  * 1.  Then, 

f(X,Z)  * (EXZ)  [V(*>]-*  » (EXZ)  (pq)^.  From  ,5.2),  of  L * p A M, 
which  proves  the  theorem. 

It  follows  from  the  original  definition  of  biserial  correlation,  as  given 
by  Pearson  [6],  that  r*  is  consistent.  This  fact  is  also  an  immediate  con- 
sequence of  relation  (3.2)  between  r*  and  ri  r— »p(X, Z)  in  probability 
as  n — > . Thus, 

* * -&t)  (zi  - 5)'  )'i->£iWs  10  rrobabillty 

and  hence  by  Theorem  III,  r*  — in  probability. 
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With  respect  to  the  magnitude  of  r*,  it  is  veil  known  that  J r*/  can 
be  greater  than  1.  Something  of  the  nature  of  {his  phenomenon  can  be  under- 
stood by  looking  at  r.  In  order  to  prove  a result  concerning  the  magnitude  of 
of  r*,  ve  shall  need  a result  from  reference  T9]* 

Theorem  IV  (Lemma  2 of  C9J): 

p(x)  q(x)  £ JjL  A2(x),  (-«®<x  * «■«) 

Nov  ve  have 


Theorem  X* 


£ 

r 


Proof* 


Rewriting  (3.2)  as 


r*  • r 


> 


ve  have,  in  view  of  the  definition  of  T, 

r*-  r . 

AW 

Theorem  IV  applies  for  any  T so  Theorem  V is  proved.  As  a consequence  of 
Theorem  V,  ve  see  that 

>1  > 

i*  * . according  as  r ' 

<-1  < 

Asymptotic  normality  of  r*,  which  will  be  needed  later  in  this  section 

* 

is  a trivial  consequence  of  a theorem  of  Cramer. 


Uaeasga a»  r*~  p)). 

Prooft  In  expression  (3.2)  the  tern  ,A(T)  is  seen  to  be  a continuous  function 
of  Z.  Thus,  r*  is  a continuous  function  of  the  sample  means  2,  Z,  X%,  XZ.( 
Applying  Cramex^s  theorem  (p.  366  of  Q.J),  we  have  asymptotic  normality  with 
the  asymptotic  variance  (3.4)  calculated  by  Soper  [8]. 

Ve  shall  now  present  two  results  which  are  more  important  than  those  just 
preceding.  They  concern  the  asymptotic,  or  large- sample , efficiency  of  r* . 

e 

Theorem  VII:  x*  is  an  asymptotically  most  efficient  estimator  of  f when 

f “0- 

Prooft  In  view  of  Theorem  VI  on  asymptotic  normality,  we  have  a right  to 
inquire  about  the  asymptotic  efficiency  of  r*,  which  will  be  denoted  by 

• 

5 (r*Kf)  “ TMKf)  • 

It  may  be  seen  from  Theorem  I,  (4.6),  that 


(5.3)  V^KO)  = aMsM  . 

a[M*)J2 

Sow,  from  (3.4)  we  observe  that  (5.3)  coincides  with  V(r*ju>,0).  The  con- 
clusion follows  from  the  definition  of  an  asymptotically  most  efficient  esti- 
mator.” 

Theorem  r*  is  an  asymptotically  least  efficient  estimator  of  f*  • when 


Ifl-H. 

Proof:  An  application  of  Theorem  IV  shows  that 
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Hence,  recalling  the  definition  of  g(x^*J,P)  in  Theorem  I,  we  see  that 

all  integrals  of  the  form  J x£  g(x,o?,f)dx  exist.  Schwarz’  inequality 
a -oc 

shows  that  V ( is  such  that  the  terms  in  braces  is  non- vanishing. 

Thus,  V^U^f)  0 as  IP)-^1.  From  the  fact  V(r*/u.!,f)  -»  Z. 
as  Ifl->1,  we  conclude  that 

The  special  case  CO  sO  has  interesting  features  which  will  appear 
in  Theorems  X and  XI.  First  we  shall  need  another  result  from  reference  , 


Theorem  g (Lemma  1 of  [9]) 

{l-2p(x)}  A (*)  - xp(x)q(x).  2.  0,  x £ 0. 

. • 

$ 

The  asymptotic  variance  of  r*  has  its  minimum  for  each  f3  at  a q„ 
Proof:  We  must  show  ® 

V(r*Hf)  V(r*|0,f) 

for  each  f , In  view-  of  symmetry,  it  will  be  sufficient  to  show  this  for 
caJ-^0.  Let, 

• ita)  = a - “ibaefeii  , bm  = 

• [AM]5  .am 

• • 

g(to)  =•  {l-Jp^)]  /(w)  -o>p(w)q(w)  # h(tv)  sss  p(u>)q(tt>)  -TTp(<M)J2  /2, 

* * 

From  this  point  until  the  end  of  the  proof,  we  shall  omit  «*>  whenever  it 
appears  as  an  argument  of  any  function.  ‘From  reference  f9}  we  have 


r 


g'  «2X2  - pq,  g«=  -4  - (2q-lM  » g(0)  — g(«)  * 0,  g’(0)p>  0, 


h*  »A(l-aq)  ♦ miu2,  hM»  A 2(rr-2-2iiw2)  -*>(i-2q)A  , 

h(0)  — h(oo)  - h' (0)  * 0,  h*  (0)  > 0. 

Accordlnglyi  v#  have  A ^ - 6.g  A “2 , B =•  h)T2  +■  7r/2,  with  k < 0t 
B £.  1/2,  both  equalities  holding  at  Toe  relation  VCr^oy*)  £ 

?(r#/ O^5)  for  all  may  be  written^A  *■  b 9^2  for  all  ^ . Since 
A 4,  0,  this  iaet  expression  is  implied  by  A + t i 2.  Tt/<^,  trtuch  in  turn 
is  equivalent  to  h ^ <-Og,  Thus,  we  must  snow  k --  h -tOg  ii), 
k'  2wq(l-q)  - 2(2q-iM +w(rr-2)  A2, 

(5.4)  k-  * 2q(l-q)  - A2  {6  -it  f«2  (2|P-  ;.)}  , 

• k{.0)  ~k(*>)  = k*(0)  = 0,  k*  (0)  = i.  - yfa  > t-' 

We  shall  show  that  there  exists  no  y such  such  k*(y)  — 0,  k(y)  < 0. 
Suppose  such  a y does  exist.  Then, 

(5.5)  2(2q-l)A  = 2yq(l-q)  + (ir-2)y  A2> 

q(  i—q)  ( i+y2)  - wA2/ 2 - y(2q-i)A  0. 

Substituting  the  right  member  of  the  first  expression  into  the  second,  we 
have  2q(l-q)  < >2{t T+  (T- 2)  y2}.  Thus,  k"  (y)  < A2  {2*^-6  -(ir-2)y*i. 

A negative  maximum  must,  from  (5.4 ),  be  followed  by  e negative  minimum. 
Hence,  from  the  above  relation  in  k*(y) , there  exist  no  extrema  which 
exceed  ^(2fM>)/(r-2)}$.  Assuming  there  is  a negative  extremum  of  k,  then 
there  must  be  a negative  minimum  in  (0,1)  • let  y be  this  minimum  point. 
Then  k*(y)^  0,  or  from  (5.4),  2q(l-q)  ->*  ^6-tT-  12T-4)  72}>0. 

Substituting  the  volve  of  2q(l-q)  obtained  from  the  first  equation  in  (5.5), 
wa  reach  (2q-l)  - yA[2  » (lT-2)  y*]  > 0. 
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The  left  member  vanishes  at  y = 0 and  has  a negative  derivative  for  0 < y^  $ 1. 
Therefore,  there  is  no  negative  minimum  in  (0,1),  and  from  the  previous 
argument  k £ 0,  which  completes  the  proof. 

Since  for  any  fixed  r*  is  a better  estimate  when  cu»0,  it  will  be 
useful  to  have  something  simpler  in  the  way  of  an  asymptotic  distribution  of 
r*  than  that  contained  in  Theorem  VI.  Ve  are  therefore  led  to 


Theorem  XI: 


When  to~0,  ve  have  to  a close  approximation 
tank"1  23L.  j{  ( tanir1 

jr  & *» 


Proofs 

V(r»|Orf>)  - 1 <f 4 - -Sr2  *■  f > - \ -r2)2  - 

Dropping  the  last  tern  and  solving  the  equation 


g'U) 


1 

(■£  - x2) 


we  have  g(x)  ~(2/V5)  tanlT1  (2x//j).  It  is  known  that 
/n"[g(r*)  - g(p)}  a;jT(0,l), 
so  the  theorem  is  proved. 


Discussion  of  Results  Concerning  r* 

In  looking  over  Theorems  V,  VI,  VII,  VIII,  Z,  and  XI,  several  facts 
stand  out.  First,  even  though  r*  is  consistent  and  asymptotically  normal, 
it  is  still  inadequate  for  estimating  f*  because  of  its  possible  magnitude 
and  its  lack  of  large  sample  efficiency  for  large  values  of  If/.  In  the 
case  of  testing  they  hypothesis  Hi  fsf«  the  first  defect  is  net  of  so 
much  consequence.  Even  in  a problem  of  estimation,  one  can  always  operate 
under  the  rule*  When  ]x*|  <1  estimate  p by  r*,  when  r*  £ 1 estimate  1, 
and  when  r*:£  -1  estimate  p=  -1.  The  gross  defect  is  lack  of  efficiency. 
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In  practically  all  applications  it  is  of  more  interest  to  detect  large 
values  of  I p I than  small  values.  In  just  such  cases  r*  is  a "worst"  esti- 
mator. On  the  other  hand,  again  speaking  in  large  sample  terms,  when  F * 0, 
x*  is  a "best"  estimator.  Hence,  if  we  base  a test  of  Hi  on  i*,  good 

results  should  be  achieved  when  I f'\  I is  small.  It  is  then  recommended 
that  i*  be  used  for  one  and  only  one  purpose,  to  test  Hi  P 0 P«  when  ! Po  I 
is  small.  If  in  addition  the  assumption  kJ  * 0 is  tenable,  then  the  vari- 
ance stabilizing  transformation  of  Theorem  XI  may  be  used,  calculations  being 
performed  with  Table  VB  of  Fisher  (p.210,  [4])*  In  such  a case  advantages 
of  the  type  discussed  by  Fisher  (pp.  197-204,  [4])  will  accrue.  /i T cr^ 
is  given  in  fable  I. 

In  the  case  of  the  problem  of  estimating  the  value  of  a particular  test 
item  for  predicting  student  performance,  W * 0 would  occur  when  the  question 
is  of  such  difficulty  that  the  average  student  would  have  probability  .50 
of  answering  it  correctly.  We  could  then  use  r*  and  tne  variance  stabilizing 
transformation  of  Theorem  XI  to  test  the  null  hypothesis  Hi  P » 0,  which  is 
the  hypothesis  that  the  question  doesn't  add  anything  to  the  predictive  value 
of  the  test.  The  acceptance  of  hypothesis  H doesn* t mean,  of  course,  that 
the  question  should  be  omitted.  It  Is  well  known  that  such  questions  have 
at  times  a useful  purpose.  Note  that  in  view  of  the  above  discussion  it 
would  be  wrong  to  use  r*  to  obtain  confidence  limits  for  p * Also  note  that 
according  to  Theorem  X a question  for  which  (*>  * 0 is  a desirable  one  to 
have. 

6.  Solution  of  the  Likelihood  Equations 
In  what  follows  all  summations  will  be  over  the  domain  i * l,2,...,n. 
From  (4*2)  it  may  be  seen  that  the  likelihood  equations  are 

y-  f lha t »i^(«i.*>)  { _ 

~ l (I-*j)  V (**,*)  T St$  J “ 0 * 


(6,1) 
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where  $ refers  to  differentiation  with  respect  to  to  or/5  . Recall 


from  (4. 11)  that, 


(6.2) 


CJOJ 


Also, 

(6.3) 


(1-r) 


,*■'>) 


'f(xi,U>)(xi~f>U>) 

(1-f2) 


By  tiie  use  of  (6.2)  and  (6.3)  equations  (6.1)  can  be  written  as 


E (xi-fo»)(2«1-l)  ^(2*rl)(jU&$)}  — 0, 


(6.4) 


(2^-1) ) $ ^( 2Sj-l)  (yy-£w*|)j  ss*  0 . 


Now  introduce  the  notation 


(6.5)  $1  = 2.t-l  , 4-  (W-fxjKl-f2)-*  , ^ , 

Ai  =^l(5*i- 

Rewriting  (6.4)  again,  in  the  new  notation,  we  have 


(6.6) 


z ^ = o.  z r**^-  o. 


Y?Z  . . ti.  . . .«•  ***  • 


*4 

Easy  differentiation  gives  ^(x)  ~ <4(x){4(*)  - xj.  Newton’s 

method  in  two  variables  gives  the  following  equations  in  ^ ^ and  4 f*  , 
where  ■oA)  = ^ f5  — P ^ and  ^ being  initial  guesses: 


(6.7) 


• JL 


^ + =-z  » 

u-fi2)* 


Let  A be  the  determinant  of  the  coefficients.  The  nethod  of  solution 
will  then  be  the  following: 

ij&tead  ai  sgi.uUgfl 

i)  Compute  (to*,  r*)  from  the  sample  (x^t  *j),  i = l,2,...,n, 
where  r*  is  the  sample  biserial  correlation  coefficient  and  “>  * is  tho 
solution  of  the  equation  p(  «>)  =•  "g.  Now,  let  — tJj*  ancl 


' r* 

when 

l**l  < 1 

Pi 

= . 

+.90 

when 

r*  £ 1 

-.90 

L 

when 

x*  ^ -1. 

ii)  Compute  , 

■y* 

Ji» 

h. 

Lxi^i>  ^i>  ^ixi*  ^lxl  ^'or  ^ “ a»2,.., n. 

where  are  defined  in  (6.5),  and  Table  II  is  used  to 

obtain  numerical  values  of  the  4*  i* 


iii)  Evaluate  the  three  determinants 
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A = 


E.  AixjL  E A1xi 


a-ii2) 


5 # 


AcJ  = 


-e  ft* iSiAj.  -eia 

- E ft"!  E AlX±  - E A^2 

E a . -E  *1^1 

E Aixi  ~E 


iv)  Obtain  ( w,  p ) from  (^<*>,  ap)  and  ( p^),  and  repeat 

the  process  using  <*-  ~ (*j^,  p *=  p2  in  place  of  ^ and  p^. 

The  rule  given  in  i)  is  somewhat  arbitrary,  but  is  believed  to  be  a 
good  rule  of  thumb.  The  longest  stage  in  the  scheme  outlined  above  is  the 
determination  of  i — l,2,...,n,  from  Table  II. 

Ve  shall  now  present  an  illustration  of  the  method.  In  order  to  have 
a good  vantage  point  for  observing  the  way  the  calculations  run,  ve  select 
a random  sample  from  C>^(0,0;l,i;i//2) . A table  of  random  numbers  from 
such  a population  is  not  available  directly,  but  can  be  constructed  from  a 
table  of  {andom  numbers  from  ff( 0,1)  as  follows!  Let 


U * sf :0,1)  , V«  ^to,l)  , 6J*.50. 

oov,  let 

x*u  , ^LL|JL  . 

Now  dichotomise  Y by  introducing  the  Z variable  i 

Z • 1 if  I * .50  and  Z * 0 if  Y < .50. 
The  computing  scheme  for  20  pairs  of  observations  follows i 
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A second  iteration  resulted  in  w 0.251,  f j°  0.489.  Since 
remained  unchanged  in  the  third  place,  the  results  were  not  included.  Recall 
that  the  true  value  of  f*  is  .707.  On  the  basis  of  our  sample  of  20, 

P s .489  is  the  beet  we  can  do.  However,  by  using  the  iterative  scheme 
instead  of  r*  we  removed  27£  of  the  error. 


7.  £ynB&£X 

The  problem  of  biserial  correlation  it  examined.  An  attempt  is  made 
to  touch  upon  all  aspects  of  the  problem,  without  sacrificing  mathematical 
rigor,  and  to  describe  the  pertinent  literature  in  its  proper  setting. 
Particular  attention  is  paid  to  the  use  of  maximum  likelihood,  and  to  the 
asymptotic  efficiency  of  the  sample  biserial  correlation  coefficient. 

Results  may  be  sunmarised  as  follows. 

(1)  The  likelihood  equations  for  «>  , the  point  of  dichotomy,  and  P , 
the  population  correlation  coefficient,  are  obtained. 

A method  for  their  solution  is  described  and  illustrated  by  an  example. 
Detailed  calculations  are  given. 

(2)  Asymptotic  variances  are  derived  for  the  maximum  likelihood  esti- 

A.  A 

mators,  <0  and  f , and  are  found  to  coincide  with  expressions  given  by 
Marita  £5]. 

(3)  The  sample  biserial  correlation  coefficient  (bisorial  r)  is  shown 
to  be  appropriate  and  very  useful  for  certain  problems  in  testing  hypotheses, 
but  essentially  worthless  in  other  situations.  Several  results  are  given 

in  reference  to  the  limiting  distribution  and  asymptotic  efficiency  of  this 
coefficient. 

(4)  Tables  are  given  for  the  asymptotic  standard  deviation  of  the 
sample  biserial  correlation  coefficient  and  for  Mills'  ratio,  the  latter 
being  useful  in  solving  the  likelihood  equations . 

(5)  Practical  suggestions  are  offered,  for  application  of  the  results 
of  the  paper,  wherever  possible. 


TABLE  I 


The  Asymptotic  Standard  Daria ti  on  of  i**’  (bisarial  r) 
as  a Function  of  p and^7. 

All  values  must  be  divided  by  /IT 


p or  1-p 


■ 

.05 

.10 

.15 

.20 

.25 

.30 

.35 

.40 

.45 

.50 

D 

4.466 

2.922 

m 

2.0a 

1.857 

1.737 

1.658 

1.608 

1.580 

1.571 

.10 

rm 

1.699 

1.521 

1.419 

1.353 

1.308 

1.278 

1.258 

1.247 

1.243 

.20 

(ggj 

1.668 

1.491 

1.389 

1.323 

1.279 

1.248 

ggg 

1.217 

1.213 

.30 

2.033 

1.616 

BES 

1.339 

1.273 

1.229 

1.198 

1.179 

1.167 

1.163 

.40 

1.971 

1.543 

1.370 

m 

1.203 

1.159 

1.128 

1.109 

1.097 

1.093 

.50 

1.093 

1.449 

1.279 

1.179 

1.114 

1.069 

1.038 

BESS 

1.008 

1.004 

.60 

1.799 

1.333 

1.167 

1.069 

1.004 

0.960 

0.930 

0.910 

0.898 

0.894 

.70 

1.691 

1.194 

tm 

0.939 

0.875 

0.831 

BESS 

0.781 

0.769 

0.766 

lea 

1.569 

1.031 

0.881 

0.789 

0.727 

0.683 

0.653 

0.632 

0.620 

0.616 

.90 

1.430 

0.842 

0.705 

0.619 

0.559 

0.517 

0.486 

0.465 

0.453 

0.449 

1.00 

1.302 

0.616 

0.503 

0.429 

0.374 

0.335 

0.304 

0.283 

0.270 

0.266 

'...SPt 
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Mills'  Hatio1*2 


-X 

*(*) 

-X 

4(x) 

.00 

.79788 

.40 

.56188 

.01 

.79152 

.41 

.55649 

•02 

.78519 

.42 

.55112 

.03 

.77887 

.43 

.54578 

.04 

.77259 

.44 

.54047 

.05 

.76632 

.45 

.53520 

.06 

.76008 

.46 

.52993 

.07 

.75387 

.47 

.52471 

•08 

.74767 

•48 

.51948 

.09 

.74U8 

.49 

.51431 

.10 

.73532 

.50 

.50917 

.11 

.72920 

.51 

.50404 

.12 

.72309 

.52 

.49893 

.13 

.71701 

.53 

.49387 

.14 

.71094 

154 

.48883 

.15 

.70491 

.55 

.48380 

.16 

.69890 

.56 

.47883 

.17 

.69291 

.57 

.47386 

.18 

.68694 

.58 

.46893 

.19 

.68099 

.59 

.46402 

.20 

.67507 

.60 

.45914 

.21 

•66917 

.61 

.45429 

.22 

.66331 

.62 

.44947 

.23 

.65747 

.63 

.44468 

.24 

.65165 

.64 

.43992 

.25 

.64584 

.65 

.43518 

.26 

.64006 

.66 

.43047 

.27 

.63431 

.67 

.42580 

.28 

.62860 

.68 

.42114 

.29 

.62289 

.69 

.41652 

.30 

.61723 

.70 

.41192 

.31 

.61158 

.71 

.40736 

.32 

.60594 

.72 

•40282 

.33 

.60035 

.73 

.39832 

.59478 

.84 

.39383 

.35 

.58923 

.75 

.38939 

.36 

.58371 

.76 

.38496 

,.37 

.57822 

.77 

.38056 

.38 

.57274 

.78 

.37621 

.39 

.56731 

.79 

.37186 

si 

a. 


1 Mills'  ratio:  <j>(x)  - 


/5i t 


Jtjt 

.V  • . . 


t zr 


4* 


2 This  table  is  rep&duced  with  ths  kind  permission  of 
Professor  Z.  V.  Blrnbaua  of  the  Laboratory  of  Statistical 
Besearoh,  University  of  Washington. 


<(4 


-X 

hx) 

-X 

/><x) 

.30 

.36756 

1.30 

.18974 

.81 

.36329 

1.31 

,18693 

.82 

-35904 

1.32 

.18414 

.83 

.35481 

1.33 

.18138 

.84 

.35062 

1.34 

.17866 

.85 

.34646 

1.35 

.17595 

.86 

.34234 

1.36 

.17328 

.87 

.33823 

1.37 

.17064 

.88 

.33416 

1.38 

.16803 

.89 

.33012 

1.39 

.16544 

.90 

.32611 

1.40 

.16286 

.91 

.32213 

1.41 

.16035 

.92 

.31818 

1.42 

.15784 

.93 

.3U25 

1.43 

.15536 

.94 

.31035 

1.44 

.15292 

.95 

.30649 

1.45 

.15050 

.96 

.30264 

1.46 

.14810 

.97 

.29884 

1.47 

.14574 

.98 

.29506 

1.48 

.14340 

.99 

.29132 

1.49 

.14108 

1.00 

,28760 

1.50 

.13879 

1.01 

,28391 

1.51 

.13653 

1.02 

,28025 

1.52 

.13429 

1.03 

,27662 

1.53 

.13208 

1.04 

,27303 

1.54 

.12991 

1.05 

.26945 

1.55 

.12775 

1.06 

26591 

1.56 

.12562 

1.07 

.26240 

1.57 

.12351 

1.08 

.25892 

1.58 

.12143 

1.09 

.25547 

• 1.59 

.11938 

1.10 

.25204 

1.60 

.11735 

1.11 

.24866 

1.61 

.11534 

1.12 

.24529 

1.62 

.11338 

1.13 

.24196 

1.63 

.11141 

1.14 

,23865 

1.64 

.10949 

1.15 

.23538 

1.65 

.10758 

1.16 

.23213 

1.66 

.10571 

1.17 

.22891 

1.67 

.10386 

1.18 

.22572 

1.68 

.10202 

1.19 

.22256 

1.69 

.10022 

1.20 

.21944 

1.70 

.09844 

1.21 

.21634 

1.71 

.09668 

1.22 

.21326 

1.72 

.09495 

1.23 

.21023 

1.73 

.09323 

1.24 

.20721 

1.74 

.09155 

1.25 

.20423 

1.75 

.08988 

1.26 

.20127 

1.76 

.08824 

1.27 

.19834 

1.77 

.08661 

1.28 

.19545 

1.78 

.08502 

1.29 

.19257 

1.79 

.08344 

-X 

P(x) 

1.80 

.08189 

1.81 

.08036 

1.82 

.07885 

1.83 

.07737 

1.84 

.07591 

1.85 

.07445 

1.86 

.07304 

1.87 

.07163 

1.88 

.07025 

1.89 

.06889 

1.90 

.06756 

1.91 

.06625 

1.92 

.06494 

1.93 

.06366 

1.94 

.06240 

1.95 

.06115 

1.96 

.05994 

1.97 

.05873 

1.98 

.05755 

1.99 

.05639 

2.00 

.05525 

2.01 

.05412 

2.02 

.05301 

2.03 

.05192 

2.04 

.05085 

2.05 

.04979 

2.06 

.04876 

2.07 

.04774 

2.08 

.04674 

2.09 

.04575 

2.10 

.04478 

2.11 

.04383 

2.12 

.04290 

2.13 

.04198 

2.14 

.04107 

2.15 

.04018 

2.16 

.03932 

2.17 

.03846 

2.18 

.03761 

2.19 

.03678 

2.20 

.03597 

2.21 

.03518 

2.22 

.03439 

2.23 

.03362 

2.24 

.03287 

2.25 

.03213 

2.26 

.03140 

2.27 

.03070 

2.28 

.02999 

2.29 

.02930 

-X 

<£(*) 

2.30 

.02864 

2.31 

.02997 

2.32 

.02733 

2.33 

.02669 

2.34 

.02607 

2.35 

.02546 

2.36 

.02486 

2.37 

.02428 

2.38 

.02370 

2.39 

.02313 

2.40 

.02258 

2.41 

.02204 

2.42 

.02151 

2.43 

.02099 

2.44 

.02048 

2.45 

.01998 

2.46 

.01950 

2.47 

.01902 

2.48 

.01854 

2.49 

.01809 

2.50 

.01764 

2.51 

.01719 

2.52 

.01677 

2.53 

.01634 

2.54 

.01594 

2.55 

.01553 

2.56 

.01514 

2.57 

.01475 

2.58 

.01438 

2.59 

.01401 

2.60 

.01364 

2.61 

.01329 

2.62 

.01295 

2.63 

.01261 

2.64 

.01228 

2.65 

.01196 

2.66 

.01165 

2.67 

.01134 

2.68 

.01104 

2 .6? 

.01075 

2.70 

.01046 

2.71 

.01017 

2.72 

.00990 

2.73 

.00964 

2.74 

.00938 

2.75 

.00912 

2.76 

.00888 

2.77 

.00863 

2.78 

.00839 

2.79 

.00816 

' ..  ’TTSSffiLt'M  ... 


-X 

<?<*> 

2.80 

.00794 

2.83. 

.00772 

2.82 

.00750 

2.83 

.00729 

2.84 

.00709 

2.85 

.00689 

2.86 

.00669 

2.87 

.00650 

2.88 

.00632 

2.89 

.00614 

2.90 

.00596 

2.91 

.00579 

2.92 

.00563 

2.93 

.00546 

2.94 

.00531 

2.95 

.00515 

2.96 

.00500 

2.97 

.00486 

2.98 

.00472 

2.99 

.00458 

3.00 

.00444 

-JEM SM 


X 

^(x) 

X 

^>(x) 

.00 

.79788 

.50 

1.1410 

.01 

.80426 

.51 

1.1484 

.02 

.81066 

.52 

1.1557 

.03 

.81708 

.53 

1.1631 

.04 

. 82351 

.54 

1.1704 

.05 

,82998 

.55 

1.1779 

.06 

.83646 

.56 

1.1854 

.07 

.84298 

.57 

1.1926 

.08 

.84950 

.58 

1.2000 

.09 

.85605 

.59 

1.2076 

.10 

.86262 

1.2151 

.11 

.86923 

.61 

1.2225 

.12 

.87582 

.62 

1.2300 

.13 

.88246 

.63 

1.2375 

.14 

.88909 

.64 

1.2450 

.15 

.89577 

.65 

1.2525 

.16 

.90246 

.66 

1.2601 

.17 

.90916 

.67 

1.2677 

.18 

.91589 

.68 

1.2753 

.19 

.92266 

.69 

1.2829 

.20 

.92941 

.70 

1.2905 

.21 

.93621 

.71 

1.2982 

.22 

.94300 

.72 

1.3058 

.23 

.94984 

.73 

1.3134 

.24 

.95668 

.74 

1.3212 

.25 

.96357 

.75 

1.3287 

.26 

.97043 

.76 

1.3364 

.27 

.97734 

.77 

1.3441 

.28 

.98427 

.78 

1.3519 

.29 

.99119 

.79 

1.3596 

.30 

.99816 

.80 

1.3674 

.31 

1.00516 

.81 

1.3751 

.32 

1,01215 

.82 

1.3829 

.33 

1.0192 

.83 

1.3906 

.34 

1.0262 

.84 

1.3986 

.35 

1.0333 

.85 

1.4063 

.36 

1.0404 

.86 

1.4U2 

.37 

1.0474 

.87 

1.4221 

.38 

1.0545 

.88 

1.4298 

.39 

1.0616 

.89 

1.4378 

.40 

1.0687 

.90 

1.4457 

.41 

1.0760 

.91 

1.4535 

.42 

1.0831 

.92 

1.4613 

.43 

1.0903 

.93 

1.4693 

.44 

1.0975 

.94 

1.4773 

.45 

1.1047 

.95 

1.4852 

.46 

1.1120 

.96 

1.4932 

.47 

1.1193 

.97 

1.5013 

•48 

1.1265 

.98 

1.5092 

.49 

1.1338 

.99 

1.5170 

r4 


50 

1.9387 

51 

1.9470 

52 

1.9554 

53 

1.9643 

54 

1.9728 

55 

1.9814 

56 

1.9904 

57 

1.9984 

58 

2.0068 

59 

2.0153 

60 

2.0243 

61 

2.0325 

62 

2.0412 

63 

2.0500 

64 

2.0585 

65 

2.0670 

66 

2.0756 

,67 

2.0846 

68 

2.0929 

,69 

2.1022 

.70 

2.1102 

.71 

2.1191 

.72 

2.1277 

.73 

2.1358 

.74 

2.1450 

.75 

2.1538 

.76 

2.1626 

.77 

2.1711 

.78 

2.1796 

.79 

2.1882 

.80 

2.1973 

.81 

2.2060 

.82 

2.2148 

.83 

2.2242 

.84 

2.2326 

.85 

2.2406 

.86 

2.2502 

.87 

2.2589 

.88 

2.2676 

.89 

2.2758 

.90 

2.2847 

.91 

2.2941 

.92 

2.3026 

.93 

2.3116 

.94 

2.3202 

.95 

2.3288 

.96 

2.3375 

.97 

2.3463 

.98 

2.3557 

.99 

2.3641 

36 


X 

</{*> 

X 

$**(*) 

2.00 

2.3730 

2.50 

2.8233 

2.01 

2.3815 

<2.51 

2.8297o 

2.02 

2.3912 

2.52 

2.8401 

2.03 

2.3992 

. 2.53 

2.8506 

2.04 

2.4079  * 

2.54 

2.8612 

2.05 

2.4178 

2.55 

• 2.8662 

2.06 

2.4266 

2.56 

2.8794 

2.07 

2.4349 

2.57 

2.8802 

2.08 

2.444* 

2.58 

2.8969 

2.09 

2.4528 

2.59 

2.9044 

2.10 

2.4624 

2.60 

2.9138 

2.11 

2.4710 

2.61 

2.9206 

2.12 

2.4808 

2.62 

2.9300 

2.13 

2.4882 

2.63 

2.9412 

2.14 

2.4975 

2.64 

2.9472 

2.15 

2.5063 

2.65 

2.9630 

2.16 

2.5151 

2.66 

2.9665 

2.17 

2.5253 

2.67 

2.9815 

2.18 

2.5329 

2.68 

2.9895 

2.19 

2.5426 

2.69 

3.0003 

2.20 

2.5517 

2.70 

3.0030 

2.21 

2.5608 

2.71 

3.0175 

2.22 

2.5694 

2.72 

3.0276 

2.23 

2.5786 

2.73 

3.0312 

2.24 

2.5867 

2.74 

3.0460 

2.25 

2.5975 

2.75 

3.0506 

2.26 

2.  6055 

2.76 

3.0618 

2.27 

2.6157 

2.77 

3.0750 

2.28 

2.6240 

2.78 

3.0769 

2.29 

2.6323 

2.79 

3.0836 

2.30 

2.6427 

2.80 

3.0941 

2.31 

2.6511 

2.81 

3.1046 

2.32 

2.6596 

2.82 

3.U62 

2.33 

2.6695 

2.83 

3.1201 

2.34 

2.6781 

2.84 

3.1279 

2.35 

2.6860 

2.85 

3.1368 

2.36 

2.6947 

2.86 

3.1506 

2.37 

2.7064 

2.87 

3.1656 

2.38 

2.7122 

2.88 

3.1706 

2.39 

2.7248 

2.89 

3.1766 

2.40 

2.7307 

2.90 

3.1817 

2.41 

2.7390 

2.91 

3.1939 

2.42 

2.7503 

2.92 

3.2113 

2.43 

2.7586 

2.93 

3.2248 

2.44 

2.7701 

2.94 

3.2321 

2.45 

2.7786 

2.95 

3.2331 

2.46 

2.7855 

2.96 

3.2404 

2.47 

2.7941 

2.97 

3.2552 

2.48 

2.8035 

2.98 

3.2712 

2.49 

2.8121 

2.99 

3.2873 

3.00 

3.2819 
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